Ontology-based data quality framework for data stream applications

نویسندگان

  • Sandra Geisler
  • Sven Weber
  • Christoph Quix
چکیده

Data Stream Management Systems (DSMS) have been proposed to address the challenges of applications which produce continuous, rapid streams of data that have to be processed in real-time. Data quality (DQ) plays an important role in DSMS as there is usually a trade-off between accuracy and consistency on the one hand, and timeliness and completeness on the other hand. Previous work on data quality in DSMS has focused only on specific aspects of DQ. In this paper, we present a flexible, holistic ontology-based data quality framework for data stream applications. Our DQ model is based on a threefold notion of DQ. First, content-based evaluation of DQ uses semantic rules which can be userdefined in an extensible ontology. Second, query-based evaluation adds DQ information to the query results and updates it while queries are being processed. Third, the application-based evaluation can use any kind of function which computes an application-specific DQ value. The whole DQ process is driven by the metadata managed in an ontology which provides a semantically clear definition of the DQ features of the DSMS. The evaluation of our approach in two case studies in the domain of traffic information systems has shown that our framework provides the required flexibility, extensibility, and performance for DQ management in DSMS.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Proceedings of the 16th International Conference on Information Quality, ICIQ 2011, Adelaide, Australia, November 18-20, 2011

s for Keynotes A Practitioner's View of the Really Big Data Quality Research Issues 12 What Does the Next Generation of Business Models Mean for Information Quality? 35 Cloud Computing and Data Quality Services 43 Employing ISO9001 to Improve Water Information Quality in New South Wales 58 Data Quality in Shell: Building IQ Knowledge and Skills 88 A Journey Towards Enhanced Data Quality in Heal...

متن کامل

Stream Querying for Private Data

Due to the dynamic nature of knowledge and data in semantic applications, i.e., ontology stream querying technologies are essential for knowledge driven data exploitation systems. Nowadays, many proposed stream reasoning solutions and implemented systems apply forward chaining completion algorithms to handle the removal and addition of axioms. In this deliverable, we propose a novel approach to...

متن کامل

Query Architecture Expansion in Web Using Fuzzy Multi Domain Ontology

Due to the increasing web, there are many challenges to establish a general framework for data mining and retrieving structured data from the Web. Creating an ontology is a step towards solving this problem. The ontology raises the main entity and the concept of any data in data mining. In this paper, we tried to propose a method for applying the "meaning" of the search system, But the problem ...

متن کامل

Towards collaborative data reduction in stream-processing systems

We consider a distributed system that disseminates high-volume event streams to many simultaneous monitoring applications over a low-bandwidth network. For bandwidth efficiency, we propose a collaborative data-reduction mechanism, ‘group-aware stream filtering’, used together with multicast, to select a small set of necessary data that satisfy the needs of a group of subscribers simultaneously....

متن کامل

Developing a BIM-based Spatial Ontology for Semantic Querying of 3D Property Information

With the growing dominance of complex and multi-level urban structures, current cadastral systems, which are often developed based on 2D representations, are not capable of providing unambiguous spatial information about urban properties. Therefore, the concept of 3D cadastre is proposed to support 3D digital representation of land and properties and facilitate the communication of legal owners...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011